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Abstract 

Recent work on fault-tolerant quantum computation making use of topological error 
correction shows great potential, with the 2d surface code possessing a threshold error 
rate approaching 1% [1,2]. However, the 2d surface code requires the use of a complex 
state distillation procedure to achieve universal quantum computation. The colour code 
of [3] is a related scheme partially solving the problem, providing a means to perform 
all Clifford group gates transversally. We review the colour code and its error correcting 
methodology, discussing one approximate technique based on graph matching. We derive 
an analytic lower bound to the threshold error rate of 6.25% under error-free syndrome 
extraction, while numerical simulations indicate it may be as high as 13.3%. Inclusion 
of faulty syndrome extraction circuits drops the threshold to approximately 0.1%. 

1 Introduction 

The development of quantum error correcting codes in 1995 [4-6] is a major milestone in the 
journey towards realising a quantum computer that is able to outperform classical computers 
for large problems. Error correction allows the suppression of decoherence rate during a 
quantum algorithm, allowing one to perform lengthy calculations such as Shor's algorithm for 
prime number factorisation [7] with high fidelity results. The threshold theorem [8] states that, 
provided all gates are constructed with a failure rate below some threshold error rate, arbitrary 
length quantum computation can be achieved by employing quantum error correction with 
polylogarithmic overhead. 

The act of concatenation, the recursive grouping of logical qubits into successively higher 
level logical qubits, is one method to form codes with a threshold. However, this concatenation 
procedure creates non-local stabilisers involving an ever increasing number of physical qubits. 
As such, threshold error rates for codes formed in this manner suffer when one is limited to 
local interactions in few dimensions. For example, the 7-qubit Steane code has a threshold of 
Pth — 1-85 x 10~ 5 [9] when restricted to a 2d lattice with only nearest-neighbour couplings, 
and the Bacon-Shor code performs similarly, p t h = 2.02 x 10 -5 [10]. On the other hand, 
topological error correcting codes are designed with such locality constraints in mind and 
hence are particularly well adapted to these architectures. It has been shown that the 2d 
surface code [11] possesses a threshold error rate approaching 1% [1,2]. Additionally, use of 
defect braiding permits for long-range, multi-qubit interactions [12,13]. 
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The major drawback of the 2d surface code lies in its use of state distillation in performing 
S and T gates to achieve universal quantum computation. This method requires one be able 
to produce mass produce logical qubits approximating the states 

\Y) = (|0) + i |1))/V2, \A) = (|0) + e^ 4 \1))/V2. (1) 

Several approximations to one of these states are run through a distillation circuit to produce 
a single state better approximating that state. For example, the \Y) state distillation circuit 
(corresponding to implementing S gates) takes seven states approximating \Y) as input to 
produce a better approximate to \Y). Through several levels of iteration, one can arrive at 
the desired state with arbitrary accuracy. This procedure requires a large number of qubits, 
potentially several orders of magnitude greater than the rest of the computer, dedicated to 
producing such states. 

This motivates work towards finding a topological scheme which bypasses the issue of state 
distillation. In this paper we will consider the colour code of [3], which may be adapted to 
incorporate the desirable features of the surface code, whilst preserving its ability to implement 
S gates transversely [14]. Although the issue of state distillation remains for T gates, it is 
significant progress towards a distillation free topological code. At present what is missing is 
a determination of the error threshold of the colour code. 

Due to the similarity between the surface code and the colour code, and the relative sizes 
of their stabilisers, one can argue that the threshold for the colour code should be between 
10 -4 and 10 -3 , but this has yet to be demonstrated numerically. Recently, the threshold for a 
different colour code scheme featuring a honeycomb lattice has been found to be p t h = 0.109(2) 
by mapping to a random 3-body Ising model [15]. In these topological codes, one typically 
makes heavy use of classical computation in order to diagnose the sources of errors from a 
given syndrome. This task is non-trivial and one must have some efficient algorithm when 
applying these codes in real life. In order to determine the threshold for such a code, one in 
fact makes use of exactly this procedure, for it is necessary to restore the state back into a 
codeword and hence determine the the logical failure. 

Following our work on the surface code, we devise a method that may be applied in real 
life without modification to correct errors on the colour code based on finding approximate 
hypcrgraph matching solutions. Using this approach, we find the average time to failure 
of an encoded quantum memory under both error-free (ideal) and error-prone (non-ideal) 
syndrome extraction. We also determine the threshold analytically under ideal syndrome 
extraction by combinatoric arguments, without the burden of performing the hypergraph 
matching. We find the threshold for the 2d colour code under ideal syndrome extraction 
to be lower bounded by pth > 6.25%, and numerical simulations indicate it may be as high 
as pth — 13.3% comparable to the honeycomb colour code [15] and the surface code [2, 16] 
under similar circumstances. However, inclusion of the syndrome extraction circuits drops 
the threshold error rate to approximately 0.1%. 

This paper is organised as follows. Section 2 reviews the colour code and briefly discusses 
how error correction is performed, assuming that some logical state has been encoded into 
the surface. The discussion of logical qubits and logical operations is deferred until section 3. 
We return to the details of error correction in section 4. The simulation procedure and the 
results under non- ideal syndrome extraction are presented in section 5. Section 6 presents 
threshold estimates using two different methods under ideal syndrome extraction, firstly by 
simulation and then by combinatorics. 

2 Error correction on the 2d colour code 

Consider the 2d lattice of qubits arranged as shown in figure 1, assuming for now that the 
lattice extends indefinitely. Each plaquette is associated with two generators of the stabiliser 
group; the tensor product of the Pauli-A matrix, A, on the qubits around its perimeter, and 
the tensor product of the Pauli-Z matrix, Z , on those same qubits. Neighbouring plaquettes 
always share two qubits, ensuring all stabilisers commute. Assigned to each stabiliser is a 
colour, red, green or blue, such that each qubit belongs to exactly one A-stabiliser and one 
iv-stabiliser of each colour. The plaquette colours shown in figure 1 are not inherent to the 
system, merely a device to aid error correction. For the purposes of this paper, red stabilisers 
will always be synonymous with square stabilisers. 
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Fig. 1. 2d lattice of qubits for the colour code. White circles represent data qubits. Ancilla qubits 
located within the plaquettes are not shown. The stabiliser generators are the tensor products of 
X and Z on the qubits around each plaquette: X0X1X2X3, Z0Z1Z2Z3, X2 X3 X4 X5 X gX^XfiXtj, 
Z2Z->,Z^Z^Z^ZtZ%Z^, etc. The state \ip) is initialised to the simultaneous +1 cigcnstate of all 
the generators. A single X error, \ip) — » X will be observed as an eigenvalue change on the 
adjacent Z-stabilisers due to the commutation relation between the Pauli matrices. 



In the absence of errors, the state of the system is the simultaneous +1 eigenstate of each 
stabiliser. An X -error on one qubit causes the state to toggle between the ±1 eigenvalue states 
of the adjacent Z-stabilisers due to the commutation relations. Ancilla qubits located within 
each plaquette allow the eigenvalue to be measured locally. The configuration of eigenvalue 
changes measured forms the syndrome, from which the physical location of the errors can be 
inferred. Similarly Z-crrors are detected by measuring the eigenvalues of the A-stabilisers. 
Figure 2 illustrates more complex possible syndromes. 

Combinations of errors often conspire in such a way to conceal the intermediate eigenvalue 
flips of many plaquettes. These sets of errors, or error chains, may be seen as the primitives 
generating the observed syndrome instead of the independent errors on individual qubits. We 
will also regard single errors as error chains. The eigenvalue changes an error chain induces 
are its terminals. The colour of a terminal is the colour of the plaquette whose eigenvalue it 
alters. Error chains may have two same colour terminals (2-chains), three terminals with one 
of each colour (3-chains), or more terminals. Any chain having in excess of three terminals 
may be decomposed into superpositions of lower order chains, with some terminals overlapped 
(figure 2e). Indeed any 2-chain (figure 2b) may be derived from a pair of 3-chains, however 
they are treated as a primitive due to their relative simplicity. Since all observable syndromes 
are generated by superpositions of error chains, the identification of error chains is equivalent 
to locating errors. Given an arbitrary syndrome, error chain identification is carried out using 
algorithms from graph theory as follows. 

A matching, M, of an undirected graph, G, is a subgraph of G such that each node in M 
has exactly one incident edge. A perfect matching is a matching where all nodes in G belong 
to M. A minimum-weight perfect matching is an element from the set of perfect matchings, 
whose sum of edge weights is minimised. Many minimum-weight perfect matchings may be 
possible, in which case we recover just one. Polynomial-time matching algorithms for graphs 
exist, for example Edmonds' blossom algorithm [17-19]. Unfortunately, error correction in 
the colour code requires a hypergraph matching algorithm, for which efficient algorithms 
are not known. A hypergraph is a generalisation of a graph, where edges are promoted to 
hyperedges, sets of arbitrary numbers of vertices. The rank of the hypergraph is the maximum 
cardinality hyperedge. This will be discussed further in section 4, where our implemention of 
the hypergraph matching is detailed. For now, we assume that such an algorithm exists. 

Given a syndrome, the eigenvalue changes observed translate to nodes on a hypergraph. 
Any possible 2-chain required to generate a pair of terminals is represented by an edge joining 
the corresponding pair of nodes. Similarly a hyperedge is added for every possible 3-chain. 
A matching on this hypergraph then represents a corresponding set of error chains which 
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Fig. 2. Examples of syndromes produced by error chains (colour online). A red circle indicates a 
physical error on the given data qubit. A yellow circle at a site indicates a change in eigenvalue 
from its previous measurement, (a) a single error toggles three plaquette eigenvalues, (b) 2-chain 
generating two red terminals, (c) 2-chain generating two blue terminals, (d) 3-chain generates 
three different colour terminals, (e) many-terminal error chain can be decomposed into 2-chains 
and 3-chains with overlapping terminals. 



together will partially generate the syndrome, because each terminal belongs to exactly one 
error chain. Thus a perfect matching will reproduce the entire syndrome, allowing error 
correction to be performed. 

Many matchings can be found for a given syndrome. Since the syndrome arises from phys- 
ical errors which have a low probability of occurring, one should rig the weights of the edges 
and hyperedges such that the matching algorithm finds the matching of maximum likelihood. 
In general, the weight of an edge between some terminals should take into consideration all 
of the different possible error chains generating those terminals. For example, although the 
terminals in both cases shown in figure 3 are equidistant, figure 3a should be weighted as more 
probable than figure 3b purely because there are more length-4 error chains generating the 
former setup. However, in order to simplify matters, we will not take this into consideration; 
when many error chains are possible, we choose the graph edge to represent only one of the 
minimum-length possibilities. Under this approximation, the weight of an edge is taken to be 
the error chain length (which is proportional the logarithm of the probability), so that a chain 
formed by 2 errors is weighted identically to two independent single error chains. Although 
a minimum-weight perfect matching under this approximation will not necessarily be the 
matching of maximum likelihood, it will nevertheless reproduce the observed syndrome using 
the fewest number of errors; the hypergraph is constructed in a way that the weight-sum of 
its minimum-weight matching is the minimum number of errors required to reproduce the 
syndrome under ideal syndrome extraction. 

The question immediately arises as to what happens when one corrects along a path of 
qubits which did not suffer errors. The discussion requires a formal introduction to logical 
qubits and logical gates, so we only make some brief remarks. The distance of a code, d, 
is defined to be the minimum number of physical operations that must be applied to the 
physical qubits encoding some logical state in order to interchange between the two logical 
states without generating any observable syndrome. Under ideal syndrome extraction, a code 
can in principle correct any [^i^J error events. This holds true for the colour code when one 
corrects by minimum-weight hypergraph matching. If fewer than errors occur, error 

correction will always produce rings of operators instead of chains of operators connecting 
boundaries. The rings of operators are stabilisers of the code, thus the logical state is restored. 

As described, many-terminal error chains such as in figure 2e are not handled, resulting 
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Fig. 3. Two different syndromes caused by four errors. Case (a) is more probable than case (b) 
as it can be generated by more length-4 error chains, and one should weight it accordingly in the 
graph to recover the matching of maximum likelihood. Our simulations do not consider such fine 
details, instead we choose only one of the possibilities covering each case. Incorporating these 
alternatives could potentially increase the threshold. 




Fig. 4. Examples of syndromes on finite lattices. Dark plaquettes show the different colour 
boundaries, where eigenvalues are not measured, (a) 2-chain with masked red terminal, (b) 3- 
chain with masked red terminal, (c) 3-chain with two masked terminals, (d) the interface between 
a green and a blue boundary can also be considered a red boundary. 



in misidentification of the causes of syndromes; this particular syndrome can be produced by 
4 errors, however it currently appears to the error corrector as a 10 error event (generated by 
a pair of 2-chains) . One solution is to add these error chains into the hypergraph in the form 
of higher cardinality hyperedges. However, this not only increases the number of hyperedges 
exponentially, but it also increases the complexity of the matching algorithm. There is a more 
elegant solution which rests on the fact that the plaquette eigenvalue observed determines only 
the parity, not the exact number of terminals, at that location; one can artificially insert a pair 
of dummy nodes into the graph for a plaquette suspected of harboring overlapping terminals, 
exactly as if two terminals had been observed there. Misplaced dummy pairs can in the worst 
case be matched to one another by a weight-0 edge, whence matching continues as if the pair 
had not been introduced. Our simulations need not anticipate such instances: as we have 
access to the number of toggles at each plaquette, we introduce a pair whenever an eigenvalue 
changes twice or more. However, in a real implementation such information is not available 
so one should devise some clever algorithm to identify these overlaps and introduce these 
nodes only as required. The benefit from introducing dummy pairs is that one eliminates 
the need to match arbitrary rank hypergraphs (only rank-3 is necessary). Note that even the 
introduction of dummy pairs on every plaquette still incurs only a polynomial-time overhead. 

A physical implementation of this code must be done on a finite lattice, and hence the 
presence of boundaries which can hide otherwise visible terminals (figure 4). There are three 
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Syndrome Hypergraph Hypergraph matching 




Fig. 5. The hypergraph constructed from a the observed syndrome. Edge weights and boundary 
nodes have not been included. An edge between two different coloured terminals is coloured only 
to serve as a reminder that it denotes a 3-chain to the other coloured boundary. The hypergraph 
matching identifies a corresponding set of error chains which, once corrected, restores the state of 
the system into a codeword state. 



boundary colours. A red boundary is the interface shielding red terminals from being observed, 
so that, for example, a lone red node can be produced by a 2-chain from a red boundary. 
Green and blue boundaries are defined equivalently. Note that the interface between green 
and blue boundaries can also be considered a red boundary. The case of a 3-chain producing 
exactly two mixed-colour terminals and one masked terminal is easily accommodated for by 
adding an edge between those two terminals in the hypergraph, with the implicit prescription 
that this edge denotes generating these two terminals by joining them to the closest boundary 
by a 3-chain. Indeed, any edge or hyperedge between some arbitrary number of nodes can 
denote whatever means is necessary to generate exactly those terminals, independently of all 
other terminals observed. 

The single terminal case is more involved. Let G denote the hypergraph one constructs as 
described so far. The intent is to create beside each node, A, an associated boundary node, 
A' . Each node is joined to its own boundary node, corresponding to generating that terminal 
independently, for example by a 2-chain to the closest same colour boundary. However, this 
change by itself permits only one perfect matching: because the degree of every boundary 
node is exactly 1, every node must be matched to its boundary. If two regular nodes, A 
and B, are matched together, their respective boundary nodes, A 1 and B' , are unmatched 
and thus this is not a valid perfect matching. The resolution is to create a subhypergraph 
within the boundary nodes mirroring G with only weight-0 edges and hyperedges, so that 
when A is matched to B (or B and C by a hyperedge), then A' is readily matched to B' 
(B 1 and C") at no extra cost. While it is not true that the boundary node's matching will 
always reflect that of the regular node's, this is of no concern; error correction works with only 
edges and hyperedges in the matching involving at least one regular node. This extra change 
successfully deceives the matching algorithm into behaving as desired, and error correction 
on finite lattices may be performed. 

Figure 5 illustrates the error correction methodology described here. It is not strictly 
limited to this colour code, and may be adapted to other similarly oriented 2d topological 
schemes, such as the honeycomb colour code. Indeed, the error correction methodology of 
the 2d surface code is a specialisation of the same technique, whereby one forms and matches 
only rank-2 hypergraphs (ie. graphs). 

3 Logical qubits and logical gates 

So far we have described the error correction procedure on the colour code, which preserves 
some quantum state encoded onto the surface. In order to perform computation, logical qubits 
must be introduced. Following the original paper [3], each logical qubit is encoded onto a 
triangular lattice, as shown in figure 6. 
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Fig. 6. Triangular lattices permitting Clifford group gates to be performed transversely. Shown 
are distance 3, 5, 7 and 9 logical qubits, and the different coloured boundaries in dark. Also shown 
on the distance-9 lattice is an example of a logical- X (or logical-Z) operation; a completely masked 
3-chain. 



The logical- X operation is defined to be any 3-chain of X-operations on the data qubits 
connecting together all three colour boundaries, or any 2-chain from a qubit along one bound- 
ary to the opposite same coloured boundary. The symmetry between X and Z-stabilisers 
constrains the logical-Z operation to be the same 3-chains but with Z applied to the sites. 
Furthermore, these chains must be odd in length to yield the correct commutation relations. 
The minimum length of these chains defines the distance of the code. Therefore all distances d 
will be assumed to be odd hereafter. Note that none of these operations change the syndrome 
because all terminals are masked by boundaries. 

The nature of the X and Z stabilisers allows logical Hadamard operations to be performed 
transversely; when applying the Hadmard gate on every qubit, the identity Z = HXH implies 
that X-chains will transform to Z-chains, and vice-versa. 

The primary interest in using the colour code is its ability to perform transversal S gates. 
Since each plaquette comprises four or eight qubits, and neighbouring plaquettes share an 
even number of qubits, when an S gate is applied to every qubit, every |0l) state acquires the 
same phase, and similarly for every state. Furthermore, for all odd distance codes, an 
odd number of qubits is enclosed within the triangle lattice, ensuring that |0l) — > \0l) and 
|li>-±i|li>. ' ' ' ' 

Finally, controlled-not gates may be implemented transversely between two different sheets 
of triangular lattices; this operation will propagate X-chains from control qubit to target 
qubit, and Z-chains from target to control. 

It is possible to adapt the colour code so that the entire quantum computer shares a 
single code [14], creating logical qubits by introducing defects into the surface — contiguous 
regions where the eigenvalues of particular stabiliser generators are no longer measured, thus 
providing extra degrees of freedom — in a similar vein to surface codes [20]. In this manner, 
it is possible to recover many of the features of surface codes such as long range gates. In 
addition, one can isolate the triangular lattices above using a defect of each colour, examples 
of which are shown in figure 7, thus permitting transversal S gates. The details go beyond 
the scope of this paper. 
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Fig. 7. Examples of defects on the colour code. Green and blue defects share the same form. 
Logical qubits are formed by one defect of each colour, allowing the colour code to adopt many of 
the benefits of the 2d surface code, with the additional benefit of transversal S gates [14]. 



Hypergraph Alice's matching Bob's problem 




Fig. 8. Alice informs Bob exactly which nodes match as hyperedges. Using this information, 
Bob factors out those nodes from the hypergraph. Since he also knows the remaining nodes are 
matched by edges, he also eliminates the hyperedges, reducing the hypergraph down to a graph 
which can be matched in polynomial time. 



4 Hypergraph mimicry and pair assignment 

An essential requirement for the colour code error correction procedure as formulated in sec- 
tion 2 is the existance of an efficient rank-3 hypergraph weighted perfect matching algorithm. 
Whether an efficient algorithm exists is unclear; additional information from, for example, the 
specialised structure and the geometry may assist the problem. Recovering the minimum- 
weight hypergraph matching, which presumably produces better results than matchings of 
higher weight-sum, is not strictly necessary and may be relaxed to achieve an efficient error 
corrector. 

While we have formulated the error correction problem such that any syndrome can be 
represented as a hypergraph matching, let us clarify that we never match the hypergraph 
directly; they are strictly limited to discussion. Our procedure is to reduce the initial hyper- 
graph problem down to a simpler but approximate graph matching problem which we can 
solve. A solution to the graph problem may be mapped back to give a hypergraph matching 
and thus is a candidate for error correction, although it will not necessarily have as low a 
weight-sum as the minimum-weight hypergraph matching. 

Consider the following scenario: Alice and Bob each have the hypergraph ahead of time, to 
which Alice has found the minimum- weight matching with weight-sum Whyper- Alice wishes 
to communicate enough information to Bob, such that Bob can still reproduce her result 
in polynomial time. She can opt to send Bob all of the hyperedges in her matching. Bob, 
knowing the remaining nodes are matched by edges, can remove all hyperedges from the 
initial hypergraph to form a graph, which he can match efficiently to recover the remainder 
of Alice's matching (hgure 8). 

It is possible for Alice to communicate only two of the three nodes in each matched 
hyperedge, and still have Bob recover her minimum-weight hypergraph matching efficiently. 
To do so, Bob must collapse the indicated pairs into a single node. For each of Alice's matched 
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Hypergraph Alice's matching Bob's problem 




Fig. 9. Alice informs Bob which green and blue nodes together form hyperedges. In order for 
Bob to recover Alice's matching, he collapses the indicated blue and green nodes into a single 
node, so that hyperedges involving both blue and green nodes become edges, and all other edges 
are discarded. Since Bob also knows that all other nodes do not form triplets, the remaining 
hyperedges may also be discarded. Thus Bob's problem reduces to a graph matching problem. 




Fig. 10. Recall that edges connecting green and blue nodes in the hypergraph implies a connection 
to a red boundary by a 3-chain. Construction of the mimic graph begins by replacing each of these 
edges with a series of edges, and introducing four new nodes: g, p, p' , b. The newly introduced p 
act as g and b combined, and p' its boundary. The edge between p and p' carries the weight of 
the original edge. The g and b nodes are necessary ensure only g and b are not used in two error 
chains; once independently and once via the p node. 



hyperedges, Bob replaces the two nodes indicated by a single node, and the hyperedges 
connecting these two nodes in the original hypergraph are replaced by edges. Because he 
knows no other nodes are matched by hyperedges, any remaining hyperedges can be removed 
and thus Bob's problem reduces again to matching a graph (figure 9). 

Obviously we do not have the luxury of Alice's input, and so in both scenarios illustrated 
we would instead perform an exhaustive search over all of Alice's possible inputs. Such a search 
exhibits exponential behaviour which may be counteracted by introducing approximations; 
trying only a small subset of the potential inputs at the expense of finding the true minimum- 
weight hypergraph matching. In order to scale up this approximation to higher distance 
codes, we will devise a speculative algorithm following Alice's second technique for it has a 
smaller domain. 

Our method to assign together pairs of nodes is also achieved by matching a specialised 
mimic graph, which we now construct. Let us temporarily neglect all red nodes and build the 
hypergraph formed by just the green and blue nodes. Since we have not included red nodes, 
in reality this is simply a graph. In order to account for the 0(n r n g ni,) rank-3 hyperedges 
without actually introducing hyperedges, one must insert 0{n g nb) extra nodes. This can be 
done by substituting each edge connecting green and blue nodes with a series of edges, via 
four newly introduced intermediate nodes: g, p, p' , b (figure 10). 

This transformation in itself does not affect matchings on this graph; the choice of matching 
the edge g, b becomes a choice of matching an alternating set of edges. However, we have the 
new interpretation of p being the amalgam of g and b, and p' its boundary. The special nodes 
g and b are disables, which always have degree 2. They serve to ensure each node does not 
participate in two error chains, once as an individual and once as a pair. The interpretation 
of p as the pair formed by combining g and b allows us to add the hyperedges into the mimic 
graph; edges joining red nodes to pair nodes. As before, we join the respective boundaries 
together to fool the perfect matching condition (figure 11). 
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Fig. 11. Red nodes in the hypergraph are incorporated into the mimic graph. The hyperedge 
between r, g, b in the original hypergraph becomes, an edge between r and p in the mimic graph. 
The red boundary node, r' , is also joined to the pair boundary node, p' by a weight-0 edge. 



We still need to incorporate the case of generating red and green terminals by connecting 
to a blue boundary, and similarly for the red and blue terminals connecting to the closest 
green boundary. We can join red nodes to green nodes (and their respective boundary nodes) 
with the implicit assumption that those two terminals join to the closest blue boundary to 
form a 3-chain. However, for our approximate method it is best to minimise the degree of the 
nodes. We instead choose to insert additional blue nodes along the blue boundary plaquettes, 
used solely for forming these 3-chains. Whether or not the boundary actually masked a 
terminal is unimportant; an introduced node can at worst be matched to its own boundary 
node by a weight-0 edge and matching continues as if they had not been introduced, akin 
to the introduction of dummy pairs. Similarly, we introduce green nodes along the green 
boundary to account for joining red and blue terminals by a 3-chain. The procedure incurs a 
polynomial overhead and can be optimised to minimuse the number of extra nodes. 

This mimic graph emulates some of the properties of the hypergraph and can be matched 
efficiently. In particular, all matchings on the hypergraph are matchings on the mimic graph. 
However, the converse if not true; matchings of mimic graphs may not necessarily be translated 
into hypergraph matchings. This is due to an implicit demand for one of three particular 
patterns within the triplet region (figure 12), when including the red nodes. The matching 
algorithm knows nothing of such patterns, nor does the placement of weights assist it. Thus 
what we extract from the mimic graph matching itself is not a correction in general; only when 
it does not contain any malformed patterns is this true. However, we can use the matching 
as a choice for pair assignment to then produce a correction. We identify the matching of r 
to p and the subsequent matching of g to g (due to g having degree-2) as the assignment of 
r to g. 

There are two reasons for this choice. First, such combinations will always arise if a 
hypergraph matching were translated into a mimic graph matching. Second, one can often 
rotate the right hand side by a weight-0 alternating cycle, forming the desired patterns (figure 
13). An alternating cycle on a graph is simple cycle with edges alternating between included 
and excluded from the matching. The weight of an alternating cycle is the sum of the weights 
of edges not in the matching minus the weight of edges in the matching, so that the new 
matching will have weight W + w. One could correct the mimic matching by searching for 
lowly weighted alternating cycles, though this can be inefficient as one must be careful about 
breaking existing well-formed patterns. We observe that for these simple cycles, one can 
instead rematch with r and g collapsed into a single node, thus avoiding these complicated 
searches. 

From this choice of pair assignments, we collapse the rank-3 hypergraph with dummy nodes 
down to a trial graph, as Bob had done previously after receiving Alice's partial input. This 
trial graph may be matched efficiently to give a trial solution with some weight-sum Wtrial- A 
single choice of assignment can go amiss, so we take the solution with the lowest weight-sum 
over several initial assignments. In addition, there are six simple mimic graph constructions 
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Implicit patterns Malformed patterns 




Fig. 12. Left shows the three (partial) patterns implicitly demanded in the mimic matching if it is 
to always be interpretable as a correction. Malformed patterns may arise in the mimic matching, 
leaving this interpretation invalid, (a) extra nodes introduced were unused, (b) green and blue 
connect together to closest red boundary, (c) red, green and blue form 3-chain. 




Fig. 13. Malformed matchings can often be corrected by adding a weight-0 alternating cycle, shown 
in dashed lines. Adding an alternating cycle to a matching toggles the edges between included 
and excluded from the matching, yielding a new matching. 
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possible — one for each permutation of red, green and blue 6 — and, in general, the minimum- 
weight matching of each can have a different weight-sum. The variants with the highest 
weight-sum matching, Wmimic, are taken to yield matchings that closest approximate the 
minimum-weight hypergraph matching, and hence give better choices of pair assignment and 
trial solutions. Only the matchings from these variants are used as choices of pair assignment. 
This probabilistic method appears to yield near-optimal matchings even as the codeword 
distance increases. The reason we assume that higher weight-sum mimic graph matchings 
give better trial solutions is due to the relative ordering of the weight-sums: 

Wmimic < Whypcr < Atrial (2) 

In general, we do not know the weight-sum of the minimum-weight rank-3 hypergraph 
matching, IFhyper- However, as we have constructed the mimic graph to encompass possible 
hypergraph matchings, the mimic matching weight-sum is upper bounded by the minimum- 
weight hypergraph matching weight-sum. Extra freedom from the mimic matching not nec- 
essarily being a correction allows for its weight-sum to be lower: W m i m i c < Whyper- Similarly, 
after deciding upon a choice of assignment, many potential hypergraph matchings are dis- 
carded. Thus all trial matchings must have weight-sum Wtriai > Whyper- Notice that if 
Atrial = W^mimic, the heuristic has not introduced any additional errors over the minimum- 
weight hypergraph matching; the trial matching itself is a minimum- weight hypergraph match- 
ing. 

It is the number of uncertain matchings we use to gauge the quality of the approximation. 
We find that using up to 6 x 25 initial trials, this method has a 95% probability of recovering 
the minimum-weight hypergraph matching with certainty from a single time step with ^rp- 
crrors scattered for large lattices. Taking 6 x 50 trials does not significantly increase the 
probability. From the remaining uncertain cases, the final correction applied typically has a 
weight-sum only one greater than the mimic matching weight-sum, so that if it were not the 
hypermatching it at least falls very close. For our simulations, we have chosen 6 x 25 initial 
trials. 

5 Simulation results 

Simulations take place on the triangular lattices of figure 6. In our simulations we determine 
the average number of syndrome extraction cycles a quantum state encoded in a colour code 
endures before error correction results in a logical failure. The simulations trace only the 
propagation of A-errors throughout the machine. 

A single simulation instance proceeds as follows. First, the quantum computer is initialised 
perfectly in the simultaneous +1 eigenstate of every stabiliser generator. At each timestep, 
each qubit has a probability p of a memory error, then the syndrome information is extracted 
simultaneously over the entire surface. Red syndrome is extracted by preparing the ancilla 
positioned within each red stabiliser in the |0) state (|+) state for Z-syndromes) . Subsequently 
four controllcd-nots are directed inwards (outwards) from the surrounding data qubits to the 
ancilla, which are then measured in the Z-basis (A-basis). A coloured plaquette syndrome is 
extracted by first preparing a 4-qubit cat-state. Each qubit in the cat-state interacts with two 
distinct data qubits in that stabiliser, and is measured independently The four measurements 
together give the parity at that stabiliser. Further details of syndrome extraction are available 
in [14]. 

A memory error on a qubit is a probability p/3 of incurring cither an A, Z or Y = XZ 
error. Preparation of |0) and |+) states each have a probability p of resulting in the |1) and 
|— ) states respectively. Similarly, measurement in the X and Z-bases have a probability p of 
obtaining the incorrect result. Finally, the two-qubit gates have an equal probability p/15 of 
each of the 15 non-trivial tensor products of /, X, Y and Z. 



b For the mimic graph permutation we have worked with, we can crudely identify the red-to-pair edge in the 
mimic graph as the 3-chain. The mimic matching may not be a correction as it allows for blue terminals to be 
either not corrected, corrected once, or corrected twice (by a 2-chain and a 3-chain). Red and green terminals 
do not suffer such problems; they will always be corrected exactly once. Thus even the interchange of green 
and blue can produce different results. 
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physical error rate 

Fig. 14. The average time to failure of a quantum memory in the colour code. The vertical axis 
is proportional to real time. Error bars represent the uncertainty in the average time taken over 
a large number of instances. The asymptotic threshold error rate is approximately 0.1%. 



After each timestep, the syndrome information is used to correct the state and check 
for logical failure. Errors during syndrome extraction are taken into account by collating 
syndrome information over time, forming a 3d structure of eigenvalue changes. Error chains 
are permitted to span through time, with those segments denoting an error during syndrome 
extraction in the prior timestep. In our simulations, an error chain segment spanning through 
time is weighted equally to one of the same length spanning though space. In addition, we 
also collect one final syndrome ideally before error correction and checking for logical failure. 
Should error correction fail using this augmented syndrome information, we record the failure 
time. Otherwise, we continue to the next timestep recollecting this syndrome non-ideally. 
Using this procedure, the average life expectency of a logical state is shown figure 14. Also 
shown is the average lifespan of a single non-error-corrected qubit. 

There are two features of significance. Firstly, the gradients of the curves are observed to 
converge to the same value for low error rates for distance-5 and above. As with the surface 
code, correlated errors during the syndrome extraction cycle has caused a distance-<i code 
to correct fewer than the expected E{d) = [^-\ errors. Despite the distance-3 colour code 
being identical to the 7-qubit Steane code, the topological method of dealing with errors during 
syndrome extraction coupled to our rather simplistic weighting of error chain segments results 
in the colour code offering no benefits over unerror-corrected qubits. A combination of these 
correlated errors during syndrome extraction and our approximate error correction method 
are responsible for distance 5, 7 and 9 codes ultimately being able to correct the same number 
of errors. However, higher distance codes can be seen to improve upon lower distances; there 
are fewer combinations of two errors causing a distance-7 code to fail than for distance-5, 
and fewer yet again for distance-9. In light of this, we assert that higher distance codes will 
eventually be able to correct more errors under the error correction method presented. 

Secondly, the intersections of successive distance codes is highly mobile. Furthermore, the 
intersections move leftwards thus do not give a lower bound on the threshold. This in part 
may be due to the presence of boundaries of the lattice; our simulations of the toric code 
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and the surface code show similar trends [2], though admittedly are much less extreme. We 
will revisit this issue in section 6.2. Due to computing limitations, we are unable simulate 
higher distance codes to observe a convergence. The existing results indicate a threshold of 
approximately 0.1%. 

6 Ideal threshold error rate 

As the asymptotic threshold under realistic syndrome extraction is difficult to find, we seek 
to determine the threshold under ideal syndrome extraction. In this limit, there is no need 
to consider error chains spanning through time, making theoretical calculations much more 
accessible. We proceed in two directions: firstly by direct simulation, and secondly by counting 
the number of dangerous syndrome patterns. 

6.1 Direct simulation 

We already have the means to simulate the colour code logical failure rates under ideal syn- 
drome extraction. One point of note is that because syndrome information is always correct, 
we no longer need to consider error chains spanning through time; error chain segments span- 
ning through time are now weighted infinitely greater than those spanning through space. As 
such, each timeslice is corrected independently of all other timeslices. This implies that the 
total number of possible syndromes is finite, thus one can determine the logical error rate per 
timestep by summing over all possible dangerous syndrome combinations: 

Q 2 
P ( L d) (p )=J2Mk)p k (l-p) (Q - k \ P=^Po (3) 

fe=0 

Here po is the physical error rate, Q(d) is the number of data qubits, and Ad(k) is the 
number of dangerous syndromes resulting from k errors in the distance-c? code. The factor of 
| is due to our error model: only two of the three possibilities A, Y, Z may lead to logical- A" 
failures. For later convenience, we will make the change of variable k — > (F + k), where 
F(d) = ^±1 is the minimum number of errors required for a distance-c? code to fail under 
true minimum-weight hypcrgraph matching and ideal syndrome extraction. We will always 
reference a prefactor Ad(F + k) by its offset k from the expected leading order term. 

Q-f 

P ( l\po) =P F E MF + k)p\l - p)^- F ^-\ p = -po (4) 



The exact value for Ad(F + k) depends on the details of the matching algorithm used, and 
can be obtained by running the error corrector for each possibility. Since the total number 
of (F + fc)-error configurations to test is large, ( F + fc ), we approximate Ad(F + k) w ( F 9 fe )rfc 
by determining only the ratio of (F + fc)-error failures, rfe, from a large random sample. The 
results in the ideal syndrome extraction limit by direct simulation of the error correction 
procedure are shown in figure 15. The data points are obtained from simulations of the code 
at certain error rates, while the curves are given by equation 4, using our matching algorithm 
to estimate the prefactors Ad(F + k). We observe a threshold of p t h = 13.3%. 

6.2 Dangerous syndrome coverage 

One can place an upper bound on Ad{F+k), the number of dangerous syndromes as a result of 
F + k errors, under true minimum- weight hypcrgraph matching. We will assume hypergraph 
matching hereafter, in particular with regards to dangerous syndromes, and simplify refer 
to it as the matching. By construction of the hypergraph, the matching will correct for all 
E(d) = ^=1 error cases: Ad(F + k) = 0,V k < 0. Non-trivial contributions to the logical 
failure rate start from F(d) = ^±1. 
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Fig. 15. Average life expectancy of a quantum state under error free syndrome extraction. The 
minimum error rate at which successive distance codes intersect is taken to best approximate the 
asymptotic threshold, p th = 13.3%. 



Q-F 

p { L d \po)=P F E A d (F + k)p k (l~p)^- F ^ k , p=-p (5) 

k=0 

For an F error syndrome to cause failure, the F errors must all fall on a single length- 
en logical operator, Od] minimum-weight matching would find it preferable to correct the 
state by applying corrections on the remaining E qubits in O e i, thus performing the logical 
operation. Thus an upper bound to Ad(F) can be determined by enumerating all length-d 
logical operators, then choosing F qubits from each. 

It is tempting to make the statement that all dangerous syndromes as a result of F + k 
errors simply stem from a dangerous F-error syndrome and scattering a further k errors. 
Unfortunately this simplistic argument is not valid, as dangerous syndrome formed otherwise 
exist due to the presence of higher length logical operators. One can find counter-examples, 
and the difference in gradient between Ad(F) (figure 16) and Ad(F + k) (figure 17) further 
reinforce their presence. 

We conjecture that all dangerous F+k error syndromes are formed by some combination of 
F errors on qubits belonging to a single continuous length-(d+2A) error chain, then scattering 
a further k errors onto the remaining qubits, where A is a constant. A continuous error chain 
is one which may be derived placing a single error, generating some terminals, then shifting 
these terminals by the rules of figure 18. The rules are defined such that any logical operator 
can be formed in this way. In particular, because length- (d + 2 A) logical operators are formed 
in this way, the conjecture trivially holds true for the ( F + A)-error case. The conjecture is 
a constraint on the distribution of errors required for logical failure for a length-d code. For 
example, not any arbitrary placement of F errors will cause a code to fail, only very select 
combinations, namely those where the errors occur on qubits belonging to a single length-d 
logical operator. 
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One can upper bound the number of continuous length- (d + 2A) error chains. We start 
by placing an initial error on the Q qubits on the lattice, generating up to three terminals. 
Discarding backtracking, each terminal can be shifted by one of up to 6 rules (figure 18). 
Some care should be taken when moving red stabilisers as one has additional options that are 
not listed, such as those shown in figure 3. Similarly, minor modifications are necessary when 
a terminal lies next to a boundary. Regardless, for a continuous length-(d + 2A) error chain, 
because each shift is accomplished by placing down pairs of errors, one must make a total of 

+ A shifts, shared between the three terminals. There are approximately (^f^) ways to 
divides these shifts between the three terminals. Thus the maximum number of continuous 
length-(c? + 2A) error chains is: 

A^<Q(^)V- 1)/2+A (6) 
Our conjecture then bounds the number of dangerous syndromes as a result of F + k: 

w+ »><*,( < +»)(«;') m 

Since F = it follows that ( d+ p X ) is exponentially bounded, using ( x * 2 ) = ( x /2)?( x /2)! — 

d + 2A\ /d + 2A\ 2d+2X 



F ) ~ \F + \ / 

Substituting Ad{F + k) back into equation 5, then simplifying using the binomial expansion: 

Q J? 

= N d 2 d+2 V (10) 



For the case of A d {F), the dangerous .F-error syndromes are the various combinations of 
F errors along the lcngth-<i logical operators. For a given combination, the continuous error 
chain covering it is the logical operator from which it was derived. In this case, terminals 
must always step towards their boundary, narrowing the choices of shifting terminals down 
to approximately 4( d ~ 1 )/ 2 . We can calculate Ad{F) exactly for hypergraph matching by 
first enumerating all length-d logical operators, and then find all unique combinations of F 
errors. The logical operators themselves can be determined, for example, by deforming some 
given initial logical operator by the combinations of the stabiliser generators. These results are 
shown in figure 16, confirming our bound on Ad(F) displays the correct asymptotic behaviour. 

The higher order prefactors Ad(F + 2) and Ad(F + 4) are shown alongside the more 
general theoretical bound in figure 17. The counts shown in these graphs are taken from the 
preceeding simulations since enumerating quickly becomes difficult. The reason for choosing 
F + 2 is that in the colour code logical operators have length d + 4n, should only contribute 
to A(F + 2) and higher. Our results show that the A d (F + k) gradients agree with the 
combinatoric error rate, giving the initial conjecture further merit. Unfortunately, due to the 
approximate nature of our matching, higher length logical operators do in fact contribute to 
Ad(F + 1) and even Ad{F), so that they too display the same Q^ 1 ^/ 2 growth rate. 

It is not necessary to determine the prefactor in the logical error rate (equation 10) with a 
great deal of precision to calculate an asymptotic threshold; the complexity itself is sufficient. 
An asymptotic threshold is obtained by comparing the logical error rates of successive distance 
codes in the limit of large distance, d, whereupon all polynomial factors in d disappear: 
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Fig. 16. The leading contribution to the logical error rate, Aj(F), when using minimum-weight 
hypergraph matching grows exponentially with the distance of the code. These results were ob- 
tained by counting the length-ci logical operators and determining unique combinations of F errors 
lying along a single logical operator. 



(d+2) 

1 = i im 

Pl 

Pth = l^¥ = 6 - 25% (12) 

Figure 19a shows the average lifespan of a quantum memory over different distances using 
equation 10. However, we have already observed that the prefactors Aa(F + k) follow two 
different growth rates — 0(4( d - 1 )/ 2 ) for k < 1, and 0{<o^ d ~^/ 2 ) for k > 2 — hence one should 
not so readily simplify the equation. Applying equation 5 with the additional clamping of 
Ad(F + k) < ( F + fc ) gives figure 19b. Interestingly, it features the same fluctuating pseudo- 
thresholds as observed in the simulations. While we have noted that our simulation Ad(F) 
grows at the faster rate, it is presumed that the earlier terms are suppressed at different rates, 
giving rise to the same effect. 

These results rest on the initial conjecture, from which we have deduced the growth rates 
of Ad(F + k). Simulation results appear to follow the predicted growth rates. However, it 
remains to be rigorously proven that some constant A exists for all d, at least on this geometry, 
from which a lower bound to the threshold evidently follows. 



7 Conclusion 

We have described a general error correction procedure suitable for many 2d topological 
codes as a minimum-weight hypergraph perfect matching problem. We have also described 
an efficient but approximate method for matching rank-3 hypergraphs required for this colour 
code, which in principle may be used for other 3-colour codes. The method seeks solutions 
by constructing two graphs: one upper-bounded by the hypergraph solution, the other lower- 
bounded. When the two bounds meet, we identify that the approximation has introduced 
no errors. Thus this method can in principle be implemented as an initial pass before less 
efficient methods, which attempt to improve the code's performance, for example to ensure 
that a distance-d code reliably corrects errors. 

Combinatoric arguments presented here suggest that the asymptotic threshold error rate 
of the colour code to be lower bounded by p t h > 6.25% under error free syndrome extraction. 
Simulations using the approximate hypergraph matching method show that the lower bound 
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distance distance 
Fig. 17. The number of dangerous syndromes as a result of F+k errors is 

The data points were obtained using the approximate matching method outlined in section 4. 




Fig. 18. Rules of shifting terminals amongst same colour plaquettes. Each move requires exactly 
two errors, shown as the red qubits. Green and blue terminals share the same rules. Red terminals 
have additional multi-step rules, such as those shown in figure 3. Extra choices may be possible 
for terminals beside the boundary. 
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Fig. 19. (a) Lower bounds to the expected average time to failure when correcting by minimum- 
weight matching under ideal syndrome extraction, using equation 10 as the logical error rate. The 
asymptotic threshold is p t ^ = 6.25%. (b) The different growth rates between A^(F + k) for k < 1 
and k > 2 can be accounted for by using equation 5 to determine the logical error rate. This 
leads to fluctuating pseudo-thresholds. In this graph, we have also used Ad(F + k) = {J^ k ) when 
equation 7 exceeds ( F ^ fc ). 



on the threshold under these conditions may be as high as pth = 13.3%. Once faulty syn- 
drome extraction circuits are introduced, numerical simulations indicate that the threshold 
may fall to approximately 0.1%. Unfortunately, this begins to encroach on the realm of the 
concatenated codes, which do not need such complex error correction procedures and some 
of which bypass the use of state distillation. An efficient specialised matching algorithm for 
the colour code in the presence of boundaries may be possible, potentially raising these lower 
bounds on the threshold and improving its prospects. 
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